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WHAT IS CLAIMED IS: 

1 . A method for analyzing gene expression data obtained &om a plurality 
of microarrays having a plurality of probes, wherein the pluraUty of probes includes 

5 mismatch (MM) probe pairs having a mismatch value and perfect match (PM) probe 
pairs having a perfect match value, the method comprising the steps of: 

obtaining image data corresponding to scaxmed microarrays, the image data 
for each scaimed microarray comprising an image corresponding to the scanned probe 
intensities, scan date, and at least one chip identifier; 
10 storing the image data for each scanned microarray in at least one database; 

applying an automated quality control process, comprising the steps of 

ia a processor, processing the image data by applying at least a portion 
of a plurality of image processing metrics comprising algorithms adapted to 
identify one or more defects selected from the group consisting of haze, bright 
15 artifacts, dim artifacts, crop circles, snow, snow, misalignment, grid 

misaligrmaent, high background intensity, saturation, scratches, cracks; 
flagging any identified defects; 

assigning a pass/fail status to each ncdcroarray based upon identified 
defects, if any; 

20 storing the processed image data in the at least one database, the processed 

image data comprising the scanned probe intensities, the scan date, the at least one 
chip identifier, the pass/fail status, the applied image processing metrics, and the 
identified defects, if any ; 

providing a user interface for searching the at least one database by selecting 
25 at least one chip parameter firom the group consisting of scan date, the at least one 
chip identifier, the pass/fail status and the plurality of image processing metrics; and 
displaying the results of the search. 

2. The method of claim 1, wherein the step of processing the image data 
fiirther comprises applying a mask to exclude data corresponding to defects. 

30 3. The method of claim 1, wherein an image processiag metric of the 

pluraUty comprises counting the MM probe pairs and PM probe pairs and flagging a 
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microarray as dim if the number of MM probe pairs is greater than the number of PM 
probe pairs. 

4. The method of claim 1, wherein an image processing metric of the 
plurality comprises determining a normalized background variance by estimating a 

5 local background intensity and its spatial variation at a plurality of locations on the 
microarray corresponding to PM probes. 

5. The method of claim 1^ wherein an image processing metric of the 
plurality comprises: 

estimating a local background intensity at a plurality of locations on the 
1 0 microarray corresponding to perfect match (PM) probe pairs; 

dividing the microarray into inner and outer portions; 
determining a mean background intensity for each of tiie inner and outer 
portions; and 

using a ratio of mean backgroimd intensities of the outer and inner portions to 
15 flag crop circles. 

6. The method of claim 1, wherein an image processing metric of the 
plurality comprises: 

generating a model using the PM probe pairs from a set of microarrays; 
applying the model to each microarray to be analyzed to determine a weights 
20 factor for each probe on the microarray. 

7. The method of claim 1 , wherein the defect is haze and the image 
processing metric is selected from the group consisting of vertical 10* percentile peak 
to median ratio, maximum/minimum ratio for horizontal 25* percentile profile, two 
edge ratios for horizontal 25* percentile profile, two edge ratios for vertical 25* 

25 percentile profile, maximum/minimum ratio for horizontal 75* percentile profile, and 
two edge ratios for horizontal 75* percentile profile, 

8. The method of claim 1, wherein the defect is bright artifacts and the 
image processing metric is selected from the group consisting of oligo B2 mean 
intensity, spike-Ln offset, spike-in coefficient of determination, two edge ratios for 

30 horizontal 25* percentile profile, two edge ratios for vertical 25* percentile profile, 
two edge ratios for horizontal 75* percentile profile, probe pair difference outlier 
horizontal variance, vertical probe pair difference outlier, negative probe pair 
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horizontal and vertical variance, negative probe pair horizontal and vertical 
maximuro/Tninimiim ratio, local backgroimd normalized variance. 

9. The method of claim 1, wherein an image processing metric of the 
pluraUty comprises a ratio of the natural log of a mean intensity of non-control 

5 oUgonucleotides to the natural log of an image fifth percentile. 

10. An automated system for analyzing gene expression data obtained 
from a plurality of chips having a plurality of probes, wherein the plurality of probes 
includes mismatch (MM) probe pairs having a mismatch value and perfect match 
(PM) probe pairs having a perfect match value, the system comprising: 

10 a database for storing image data for a plurality of scanned chips comprising 

an image corresponding to scanned probe intensities and a plurality of chip 
parameters corresponding to the scanned chip, wherein the chip parameters are 
selected fi-om a group consisting of scan date, chip type, lot number, image processing 
metrics, and pass/fail status; 

15 a user interface for receiving a user query comprising at least one chip 

parameter and for displaying information responsive to the query; 

a processor for processing the image data for quality control by applying at 
least one of a plurality of image processing metrics adapted to identify defects 
selected from the group consisting of haze, bright artifacts, dim artifacts, crop circles, 

20 snow, snow, misaUgnment, grid misalignment, high background intensity, saturation, 
scratches, cracks, and for searching the database for records corresponding to the 
selected at least one chip parameter. 

1 1 . The system of claim 10, wherein the processor is further operable to 
apply a mask to exclude data corresponding to defects. 

25 12. The system of claim 10, wherein an image processing metric of the 

pluraUty comprises coimting the MM probe pairs and PM probe pairs and flagging a 
microarray as dim if the number of MM probe pairs is greater than the number of PM 
probe pairs. 

13. The system of claim 10, wherein an image processing metric of the 
30 plurality comprises determining a normalized background variance by estimating a 
local background intensity and its spatial variation at a plurality of locations on the 
microarray corresponding to PM probes. 
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14. The system of claim 10, wherein an image processing metric of the 
pluraUty comprises: 

estimating a local backgroimd intensity at a plurality of locations on the 
microarray corresponding to perfect match (PM) probe pairs; 
5 dividing the microarray into inner and outer portions; 

determining a mean background intensity for each of the inner and outer 
portions; and 

using a ratio of mean background intensities of the outer and inner portions to 
flag crop circles. 

10 15. The system of claim 10, wherein an image processing metric of the 

plurality comprises: 

generating a model using the PM probe pairs from a set of microarrays; 
applying the model to each microarray to be analyzed to determine a weights 
factor for each probe on the microarray. 

15 16. The system of claim 1 0, wherein the defect is haze and the image 

processing metric is selected from the group consisting of vertical 10* percentile peak 
to median ratio, maximum/minimum ratio for horizontal 25* percentile profile, two 
edge ratios for horizontal 25* percentile profile, two edge ratios for vertical 25* 
jiercentile profile, maximum/minimum ratio for horizontal 75* percentile profile, and 

20 two edge ratios for horizontal 75* percentile profile. 

17. The system of claim 10, wherein the defect is bri^t artifacts and the 
image processing metric is selected from the group consisting of oligo B2 mean 
intensity, spike-in ofi&et, spike-in coefficient of determination, two edge ratios for 
horizontal 25* percentile profile, two edge ratios for vertical 25* percentile profile, 

25 twoTedge ratios for horizontal 75* percentile profile, probe pair difference outlier 
horizontal variance, vertical probe pair difference outlier, negative probe pair 
horizontal and vertical variance, negative probe pair horizontal and vertical 
maximuiii/miiiimum ratio, local background normalized variance. 

18. The system of claim 10, wherein an image processing metric of the 
30 pliu-ality comprises a ratio of the natural log of a mean intensity of non-control 

oligonucleotides to the natural log of an image fifth percentile. 
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19. A method for determining quality of a microarray comprising a 
plurality of probes including PM probes, the method comprising: 

in a set of microarrays comprising a plurality of transcripts, determining a 
probe weight for each PM probe using RMA analysis; and 
5 calculating a relative weight factor for each transcript by taking the inverse of 

the square root of the sum of the probe weights for each PM probe; 

wherein a higher relative weigjht factor value corresponds to a lower quality 
microarray. 

20. The method of claim 19, wherein RMA analysis comprises: 
1 0 background correcting a PM probe value for each PM probe; 

log2 transfomiing the background-corrected PM probe values; 
quantile normalizing the log2 transformed probe values for the set of 
microarrays; and 

applying median polish to the quantile normalized values to obtain the probe 
1 5 weight for each PM probe. 

21 . A method for determining quality of a microarray comprising a 
plurality of probes including PM probes, the method comprising: 

in a set of microarrays comprising a pluraUty of transcripts, determining a 
probe weight for each PM probe using RMA analysis; and 
20 calculating a quality metric according to the relationship: 

w = . ^ ,where Wj is the weight of the PM probe. 

22. The method of claim 21, wherein RMA analysis comprises: 
backgroimd correcting a PM probe value for each PM probe; 

log2 transforming the background-corrected PM probe values; 
25 quantile normalizing the log2 transformed probe values for the set of 

microarrays; and 

applying median polish to the quantile normalized values to obtain the probe 
weight for each PM probe. 

23. A method for determining quality of a microarray comprising a 
30 pliurality of probes including PM probes, the method comprising: 
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in a set of microarrays comprising a plurality of transcripts, determining a 
probe weigjit for each PM probe using RMA analysis; and 

calculating a quality metric according to the relationship: 

£ £ ^Jk > M/» k ,where Wjk is the weight of they* PM probe of transcript k, 
J * 

.5 andjc=lor2. 

24. The method of claim 23, wherein RMA analysis comprises: 
background correcting a PM probe value for each PM probe; 

log2 transforming the backgroimd-corrected PM probe values; 
quantile normalizing the log2 transformed probe values for the set of 
10 microarra}^; and 

applying median polish to the quantile normalized values to obtain the probe 
weight for each PM probe. 

25. A method for determining quality of a microarray comprising a 
plurality of probes including PM probes, the method comprising: 

15 in a set of microarrays comprising a plurality of transcripts, determining a 

probe weight for each PM probe using RMA analysis; and 

calculating a quality metric according to the relationship: 
2 2 & ~ ^y* i ^ » where Wj^is the weight of the PM probe of transcript k. 

J k 

26. The method of claim 25, wherein RMA analysis comprises: 
20 background correcting a PM probe value for each PM probe; 

log2 transforming the background-corrected PM probe values; 
quantile normalizing the log2 transformed probe values for the set of 
microarrays; and 

applying median poUsh to the quantile normalized values to obtain the probe 
25 weight for each PM probe. 
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